Skip to content

Conversation

@DaanHoogland
Copy link
Contributor

Description

This PR...

Fixes: #9962

Types of changes

  • Breaking change (fix or feature that would cause existing functionality to change)
  • New feature (non-breaking change which adds functionality)
  • Bug fix (non-breaking change which fixes an issue)
  • Enhancement (improves an existing feature and functionality)
  • Cleanup (Code refactoring and cleanup, that may add test cases)
  • build/CI
  • test (unit or integration test code)

Feature/Enhancement Scale or Bug Severity

Feature/Enhancement Scale

  • Major
  • Minor

Bug Severity

  • BLOCKER
  • Critical
  • Major
  • Minor
  • Trivial

Screenshots (if appropriate):

How Has This Been Tested?

How did you try to break this feature and the system with this change?

@weizhouapache
Copy link
Member

@DaanHoogland
target to 4.19/4.20 or main ?

@codecov
Copy link

codecov bot commented Dec 3, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 15.13%. Comparing base (47f6019) to head (0bfaef3).
Report is 1 commits behind head on 4.19.

Additional details and impacted files
@@            Coverage Diff            @@
##               4.19   #10028   +/-   ##
=========================================
  Coverage     15.13%   15.13%           
- Complexity    11261    11263    +2     
=========================================
  Files          5408     5408           
  Lines        473842   473842           
  Branches      57771    57771           
=========================================
+ Hits          71696    71697    +1     
  Misses       394145   394145           
+ Partials       8001     8000    -1     
Flag Coverage Δ
uitests 4.30% <ø> (ø)
unittests 15.85% <ø> (+<0.01%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@github-actions
Copy link

github-actions bot commented Dec 3, 2024

This pull request has merge conflicts. Dear author, please fix the conflicts and sync your branch with the base branch.

@DaanHoogland DaanHoogland changed the base branch from main to 4.19 December 3, 2024 11:40
Copy link
Member

@weizhouapache weizhouapache left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

code lgtm

@DaanHoogland DaanHoogland marked this pull request as ready for review December 3, 2024 14:49
@DaanHoogland
Copy link
Contributor Author

@blueorangutan package

@blueorangutan
Copy link

@DaanHoogland a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress.

@blueorangutan
Copy link

Packaging result [SF]: ✔️ el8 ✔️ el9 ✔️ debian ✔️ suse15. SL-JID 11695

@DaanHoogland
Copy link
Contributor Author

@blueorangutan test

@blueorangutan
Copy link

@DaanHoogland a [SL] Trillian-Jenkins test job (ol8 mgmt + kvm-ol8) has been kicked to run smoke tests

@DaanHoogland DaanHoogland changed the title Remove SNI option that is correct as default in _run.sh Remove SNI option, as it is correct as default, in _run.sh Dec 4, 2024
@DaanHoogland DaanHoogland changed the title Remove SNI option, as it is correct as default, in _run.sh Remove SNI option in _run.sh, as it is correct as default. Dec 4, 2024
@blueorangutan
Copy link

[SF] Trillian test result (tid-11840)
Environment: kvm-ol8 (x2), Advanced Networking with Mgmt server ol8
Total time taken: 52094 seconds
Marvin logs: https://github.com/blueorangutan/acs-prs/releases/download/trillian/pr10028-t11840-kvm-ol8.zip
Smoke tests completed. 129 look OK, 4 have errors, 0 did not run
Only failed and skipped tests results shown below:

Test Result Time (s) Test File
test_02_list_cpvm_vm Failure 0.05 test_ssvm.py
test_04_cpvm_internals Failure 0.05 test_ssvm.py
test_01_secure_vm_migration Error 134.41 test_vm_life_cycle.py
test_01_secure_vm_migration Error 134.41 test_vm_life_cycle.py
test_06_download_detached_volume Failure 436.36 test_volumes.py
test_01_redundant_vpc_site2site_vpn Failure 421.60 test_vpc_vpn.py

@DaanHoogland
Copy link
Contributor Author

[SF] Trillian test result (tid-11840) Environment: kvm-ol8 (x2), Advanced Networking with Mgmt server ol8 Total time taken: 52094 seconds Marvin logs: https://github.com/blueorangutan/acs-prs/releases/download/trillian/pr10028-t11840-kvm-ol8.zip Smoke tests completed. 129 look OK, 4 have errors, 0 did not run Only failed and skipped tests results shown below:
Test Result Time (s) Test File
test_02_list_cpvm_vm Failure 0.05 test_ssvm.py
test_04_cpvm_internals Failure 0.05 test_ssvm.py
test_01_secure_vm_migration Error 134.41 test_vm_life_cycle.py
test_01_secure_vm_migration Error 134.41 test_vm_life_cycle.py
test_06_download_detached_volume Failure 436.36 test_volumes.py
test_01_redundant_vpc_site2site_vpn Failure 421.60 test_vpc_vpn.py

some of these errors seem completely unrelated and some might be, re-running for comparison

@DaanHoogland DaanHoogland force-pushed the 9962-ssvm-unable-to-find-valid-certification-path-to-requested-target branch from 969205a to 0bfaef3 Compare December 5, 2024 08:35
@DaanHoogland
Copy link
Contributor Author

@blueorangutan package

@blueorangutan
Copy link

@DaanHoogland a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress.

@sonarqubecloud
Copy link

sonarqubecloud bot commented Dec 5, 2024

@blueorangutan
Copy link

Packaging result [SF]: ✔️ el8 ✔️ el9 ✔️ debian ✔️ suse15. SL-JID 11720

@DaanHoogland
Copy link
Contributor Author

@blueorangutan test

@blueorangutan
Copy link

@DaanHoogland a [SL] Trillian-Jenkins test job (ol8 mgmt + kvm-ol8) has been kicked to run smoke tests

@blueorangutan
Copy link

[SF] Trillian test result (tid-11854)
Environment: kvm-ol8 (x2), Advanced Networking with Mgmt server ol8
Total time taken: 45381 seconds
Marvin logs: https://github.com/blueorangutan/acs-prs/releases/download/trillian/pr10028-t11854-kvm-ol8.zip
Smoke tests completed. 132 look OK, 1 have errors, 0 did not run
Only failed and skipped tests results shown below:

Test Result Time (s) Test File
test_01_secure_vm_migration Error 133.29 test_vm_life_cycle.py
test_01_secure_vm_migration Error 133.29 test_vm_life_cycle.py

@DaanHoogland
Copy link
Contributor Author

test_01_secure_vm_migration fails as "Unable to deploy the VM as the host: ol8.localdomain is not in the right state" is consistent, however I am not sure if it is related as I have seen this error in other PRs as well ... needs investigation.

@weizhouapache
Copy link
Member

test_01_secure_vm_migration fails as "Unable to deploy the VM as the host: ol8.localdomain is not in the right state" is consistent, however I am not sure if it is related as I have seen this error in other PRs as well ... needs investigation.

@DaanHoogland
seems unrelated.

I recall we have faced the issue before

2024-12-05 19:22:41,560 DEBUG [c.c.h.Status] (AgentConnectTaskPool-13:ctx-1c122273) (logid:0523be08) Unable to update host for event:Ready. Name=ol8.localdomain; New=[status=Up:msid=null:lastpinged=1692799264]; Old=[status=Connecting:msid=null:lastpinged=1692799264]; DB=[status=Connecting:msid=32987513095075:lastpinged=1692799259:old update count=45]
2024-12-05 19:22:41,561 INFO  [c.c.u.e.CSExceptionErrorCode] (AgentConnectTaskPool-15:ctx-b814e2d3) (logid:b85f4adb) Could not find exception: com.cloud.exception.ConnectionException in error code list for exceptions
2024-12-05 19:22:41,561 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] (AgentTaskPool-9:ctx-d2c55a62) (logid:315eb3a2) Notifying other nodes of to disconnect
2024-12-05 19:22:41,562 DEBUG [c.c.a.m.AgentManagerImpl] (AgentConnectTaskPool-15:ctx-b814e2d3) (logid:b85f4adb) Failed to handle host connection:
com.cloud.exception.ConnectionException: Unable to acquire lock on host 1922ce9f-4d1d-453e-bf1f-4676592379a4
    at com.cloud.agent.manager.AgentManagerImpl.sendReadyAndGetAttache(AgentManagerImpl.java:1155)
    at com.cloud.agent.manager.AgentManagerImpl.handleConnectedAgent(AgentManagerImpl.java:1168)
    at com.cloud.agent.manager.AgentManagerImpl$HandleAgentConnectTask.runInContext(AgentManagerImpl.java:1252)
    at org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48)
    at org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55)
    at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102)
    at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52)
    at org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:829)

cc @vishesh92

@DaanHoogland
Copy link
Contributor Author

ok @weizhouapache @vishesh92 do we merge this and create a new issue for this?

test_01_secure_vm_migration fails as "Unable to deploy the VM as the host: ol8.localdomain is not in the right state" is consistent, however I am not sure if it is related as I have seen this error in other PRs as well ... needs investigation.

@DaanHoogland seems unrelated.

I recall we have faced the issue before

2024-12-05 19:22:41,560 DEBUG [c.c.h.Status] (AgentConnectTaskPool-13:ctx-1c122273) (logid:0523be08) Unable to update host for event:Ready. Name=ol8.localdomain; New=[status=Up:msid=null:lastpinged=1692799264]; Old=[status=Connecting:msid=null:lastpinged=1692799264]; DB=[status=Connecting:msid=32987513095075:lastpinged=1692799259:old update count=45]
2024-12-05 19:22:41,561 INFO  [c.c.u.e.CSExceptionErrorCode] (AgentConnectTaskPool-15:ctx-b814e2d3) (logid:b85f4adb) Could not find exception: com.cloud.exception.ConnectionException in error code list for exceptions
2024-12-05 19:22:41,561 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] (AgentTaskPool-9:ctx-d2c55a62) (logid:315eb3a2) Notifying other nodes of to disconnect
2024-12-05 19:22:41,562 DEBUG [c.c.a.m.AgentManagerImpl] (AgentConnectTaskPool-15:ctx-b814e2d3) (logid:b85f4adb) Failed to handle host connection:
com.cloud.exception.ConnectionException: Unable to acquire lock on host 1922ce9f-4d1d-453e-bf1f-4676592379a4
    at com.cloud.agent.manager.AgentManagerImpl.sendReadyAndGetAttache(AgentManagerImpl.java:1155)
    at com.cloud.agent.manager.AgentManagerImpl.handleConnectedAgent(AgentManagerImpl.java:1168)
    at com.cloud.agent.manager.AgentManagerImpl$HandleAgentConnectTask.runInContext(AgentManagerImpl.java:1252)
    at org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48)
    at org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55)
    at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102)
    at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52)
    at org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:829)

cc @vishesh92

@weizhouapache
Copy link
Member

ok @weizhouapache @vishesh92 do we merge this and create a new issue for this?

I think we can merge

@DaanHoogland DaanHoogland merged commit 971a5b2 into 4.19 Dec 6, 2024
49 of 50 checks passed
@DaanHoogland DaanHoogland deleted the 9962-ssvm-unable-to-find-valid-certification-path-to-requested-target branch December 6, 2024 13:46
@DaanHoogland
Copy link
Contributor Author

thanks @weizhouapache merged. I will keep an I on the HealthCheck PR for this one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

ssvm: unable to find valid certification path to requested target

4 participants